Overview

Dataset Statistics

Number of Variables 21
Number of Rows 7043
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 7.8 MB
Average Row Size in Memory 1.1 KB
Variable Types
  • Categorical: 19
  • Numerical: 2

Dataset Insights

tenure is skewed Skewed
MonthlyCharges is skewed Skewed
customerID has a high cardinality: 7043 distinct values High Cardinality
TotalCharges has a high cardinality: 6531 distinct values High Cardinality
customerID has constant length 10 Constant Length
SeniorCitizen has constant length 1 Constant Length
customerID has all distinct values Unique

Variables


customerID

categorical

Approximate Distinct Count 7043
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 528225

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 7590-VHVEG
2nd row 5575-GNVDE
3rd row 3668-QPYBK
4th row 7795-CFOCW
5th row 9237-HQITU

Letter

Count 35215
Lowercase Letter 0
Space Separator 0
Uppercase Letter 35215
Dash Punctuation 7043
Decimal Number 28172
  • customerID contains many words: 7043 words
  • customerID has words of constant length

gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 492943

Length

Mean 4.9905
Standard Deviation 1
Median 4
Minimum 4
Maximum 6

Sample

1st row Female
2nd row Male
3rd row Male
4th row Male
5th row Female

Letter

Count 35148
Lowercase Letter 28105
Space Separator 0
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Male, Female) take over 50.0%

SeniorCitizen

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 464838
  • The largest value (0) is over 5.17 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 7043
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5.17 times larger than the second largest value (1)
  • SeniorCitizen has words of constant length

Partner

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 475283

Length

Mean 2.483
Standard Deviation 0.4997
Median 2
Minimum 2
Maximum 3

Sample

1st row Yes
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 17488
Lowercase Letter 10445
Space Separator 0
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%

Dependents

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 473991
  • The largest value (No) is over 2.34 times larger than the second largest value (Yes)

Length

Mean 2.2996
Standard Deviation 0.4581
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 16196
Lowercase Letter 9153
Space Separator 0
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 2.34 times larger than the second largest value (yes)

tenure

numerical

Approximate Distinct Count 73
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 112688
Mean 32.3711
Minimum 0
Maximum 72
Zeros 11
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • tenure is skewed right (γ1 = 0.2395)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 9
Median 29
Q3 55
95-th Percentile 72
Maximum 72
Range 72
IQR 46

Descriptive Statistics

Mean 32.3711
Standard Deviation 24.5595
Variance 603.1681
Sum 227990
Skewness 0.2395
Kurtosis -1.3872
Coefficient of Variation 0.7587
  • tenure is not normally distributed (p-value 2.8308483309475236e-12)

PhoneService

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 478242
  • The largest value (Yes) is over 9.33 times larger than the second largest value (No)

Length

Mean 2.9032
Standard Deviation 0.2958
Median 3
Minimum 2
Maximum 3

Sample

1st row No
2nd row Yes
3rd row Yes
4th row No
5th row Yes

Letter

Count 20447
Lowercase Letter 13404
Space Separator 0
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Yes, No) take over 50.0%
  • The largest value (yes) is over 9.33 times larger than the second largest value (no)

MultipleLines

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 484400

Length

Mean 3.7775
Standard Deviation 4.0304
Median 3
Minimum 2
Maximum 16

Sample

1st row No phone service
2nd row No
3rd row No
4th row No phone service
5th row No

Letter

Count 25241
Lowercase Letter 18198
Space Separator 1364
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%

InternetService

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 502166

Length

Mean 6.3
Standard Deviation 4.1788
Median 3
Minimum 2
Maximum 11

Sample

1st row DSL
2nd row DSL
3rd row DSL
4th row DSL
5th row Fiber optic

Letter

Count 41275
Lowercase Letter 29390
Space Separator 3096
Uppercase Letter 11885
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Fiber optic, DSL) take over 50.0%

OnlineSecurity

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 499842
  • The largest value (No) is over 1.73 times larger than the second largest value (Yes)

Length

Mean 5.97
Standard Deviation 6.8665
Median 3
Minimum 2
Maximum 19

Sample

1st row No
2nd row Yes
3rd row Yes
4th row Yes
5th row No

Letter

Count 38995
Lowercase Letter 31952
Space Separator 3052
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 2.49 times larger than the second largest value (yes)

OnlineBackup

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 500252

Length

Mean 6.0283
Standard Deviation 6.8368
Median 3
Minimum 2
Maximum 19

Sample

1st row Yes
2nd row No
3rd row Yes
4th row No
5th row No

Letter

Count 39405
Lowercase Letter 32362
Space Separator 3052
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 1.9 times larger than the second largest value (yes)

DeviceProtection

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 500245

Length

Mean 6.0273
Standard Deviation 6.8373
Median 3
Minimum 2
Maximum 19

Sample

1st row No
2nd row Yes
3rd row No
4th row Yes
5th row No

Letter

Count 39398
Lowercase Letter 32355
Space Separator 3052
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 1.91 times larger than the second largest value (yes)

TechSupport

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 499867
  • The largest value (No) is over 1.7 times larger than the second largest value (Yes)

Length

Mean 5.9736
Standard Deviation 6.8648
Median 3
Minimum 2
Maximum 19

Sample

1st row No
2nd row No
3rd row No
4th row Yes
5th row No

Letter

Count 39020
Lowercase Letter 31977
Space Separator 3052
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 2.45 times larger than the second largest value (yes)

StreamingTV

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 500530

Length

Mean 6.0677
Standard Deviation 6.8163
Median 3
Minimum 2
Maximum 19

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 39683
Lowercase Letter 32640
Space Separator 3052
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 1.6 times larger than the second largest value (yes)

StreamingMovies

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 500555

Length

Mean 6.0713
Standard Deviation 6.8144
Median 3
Minimum 2
Maximum 19

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 39708
Lowercase Letter 32665
Space Separator 3052
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 1.58 times larger than the second largest value (yes)

Contract

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 537389
  • The largest value (Month-to-month) is over 2.29 times larger than the second largest value (Two year)

Length

Mean 11.3012
Standard Deviation 2.9851
Median 14
Minimum 8
Maximum 14

Sample

1st row Month-to-month
2nd row One year
3rd row Month-to-month
4th row One year
5th row Month-to-month

Letter

Count 68676
Lowercase Letter 61633
Space Separator 3168
Uppercase Letter 7043
Dash Punctuation 7750
Decimal Number 0
  • The top 2 categories (Month-to-month, Two year) take over 50.0%

PaperlessBilling

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 476052

Length

Mean 2.5922
Standard Deviation 0.4915
Median 3
Minimum 2
Maximum 3

Sample

1st row Yes
2nd row No
3rd row Yes
4th row No
5th row Yes

Letter

Count 18257
Lowercase Letter 11214
Space Separator 0
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Yes, No) take over 50.0%

PaymentMethod

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 588585

Length

Mean 18.5702
Standard Deviation 5.0404
Median 16
Minimum 12
Maximum 25

Sample

1st row Electronic check
2nd row Mailed check
3rd row Mailed check
4th row Bank transfer (aut...
5th row Electronic check

Letter

Count 114549
Lowercase Letter 107506
Space Separator 10109
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Electronic check, Mailed check) take over 50.0%

MonthlyCharges

numerical

Approximate Distinct Count 1585
Approximate Unique (%) 22.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 112688
Mean 64.7617
Minimum 18.25
Maximum 118.75
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • MonthlyCharges is skewed left (γ1 = -0.2205)

Quantile Statistics

Minimum 18.25
5-th Percentile 19.65
Q1 35.5
Median 70.35
Q3 89.85
95-th Percentile 107.4
Maximum 118.75
Range 100.5
IQR 54.35

Descriptive Statistics

Mean 64.7617
Standard Deviation 30.09
Variance 905.4109
Sum 456116.6
Skewness -0.2205
Kurtosis -1.2572
Coefficient of Variation 0.4646
  • MonthlyCharges is not normally distributed (p-value 4.761511582207611e-14)

TotalCharges

categorical

Approximate Distinct Count 6531
Approximate Unique (%) 92.7%
Missing 0
Missing (%) 0.0%
Memory Size 499190

Length

Mean 5.8775
Standard Deviation 1.0213
Median 6
Minimum 1
Maximum 7

Sample

1st row 29.85
2nd row 1889.5
3rd row 108.15
4th row 1840.75
5th row 151.65

Letter

Count 0
Lowercase Letter 0
Space Separator 11
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 34676
  • TotalCharges contains many words: 6432 words

Churn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 473750
  • The largest value (No) is over 2.77 times larger than the second largest value (Yes)

Length

Mean 2.2654
Standard Deviation 0.4416
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row Yes
4th row No
5th row Yes

Letter

Count 15955
Lowercase Letter 8912
Space Separator 0
Uppercase Letter 7043
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 2.77 times larger than the second largest value (yes)

Interactions

Correlations

Missing Values